Perceptual processing of audiovisual Lombard speech
نویسندگان
چکیده
Seeing the talker improves the intelligibility of speech degraded by noise (a visual speech enhancement effect). This experiment examined whether this enhancement is greater when the speech signals were recorded in noise compared to when they were recorded in quiet. Ten sentences were spoken by four people either in-quiet or whilst they were listening to cocktail party noise (in-noise). The visual speech of these talkers was measured using 24 facial markers and associated video clips filmed. Sixty participants were tested in a speech-in-noise identification experiment under four different conditions. Visual speech enhancement was measured by mean percent words correctly identified in the audiovisual minus auditory-only condition scores. After ceiling effects were curtailed the results showed that the audiovisual enhancement for speech signals recorded in noise (43%) was significantly greater than that for speech recorded in quiet (30%).
منابع مشابه
Audiovisual processing of Lombard speech
Perception results are presented that address the role of Lombard speech in auditory and audiovisual speech perception. Basically, visual enhancement neutralizes the advantage of Lombard speech observed for auditory perception. It remains an open question whether or not Lombard speech is preferable for perception studies of speech in noise.
متن کاملInvestigating the role of the Lombard reflex in visual and audiovisual speech recognition
This study focuses on the analysis of the Lombard effect in visual and audiovisual speech recognition. Previous studies have shown that the performance of an audio-only automatic speech recognizer decreases in noisy environments because of the Lombard reflex. A few studies have considered the visual changes due to the Lombard reflex, but the role of the Lombard reflex in automatic visual speech...
متن کاملAudiovisual Lombard speech: reconciling production and perception
An earlier study compared audiovisual perception of speech ’produced in environmental noise’ (Lombard speech) and speech ’produced in quiet’ with the same environmental noise added. The results and showed that listeners make differential use of the visual information depending on the recording condition, but gave no indication of how or why this might be so. A possible confound in that study wa...
متن کاملTowards a lexical fuzzy logical model of perception: the time-course of audiovisual speech processing in word identification
This study investigates the time-course of information processing in both visual as well as in the auditory speech as used for word identification in face-to-face communication. It extends the limited previous research on this topic and provides a valuable database for future research in audiovisual speech perception. An evaluation of models of speech perception by ear and eye in their ability ...
متن کاملTHE ROLE OF FACIAL GESTURAL INFORMATION IN SUPPORTING PERCEPTUAL LEARNING OF DEGRADED SPEECH By
....................................................................................... .... ii ACKNOWLEDGEMENTS ........................................................................... iv LIST OF FIGURES .................................................................................... viii CHAPTER 1. GENERAL INTRODUCTION ....................................................... . 1 Speech ...
متن کامل